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^ ■ Abstract 
< 

• The diversity-multiplexing tradeoff of the dynamic decode-and-forward protocol is characterized for the half- 

duplex three-terminal (m, k, n)-relay channel where the source, relay and the destination terminals have m, k and 
n antennas, respectively. It is obtained as a solution to a simple, two-variable, convex optimization problem and 
this problem is solved in closed form for special classes of relay channels, namely, the (1, k, 1) relay channel, the 
, (n, l,n) relay channel and the (2, k, 2) relay channel. Moreover, the tradeoff curves for a certain class of relay 

channels, such as the (m, k,n > k) channels, are identical to those for the decode-and-forward protocol for the full 
duplex channel while for other classes of channels they are marginally lower at high multiplexing gains. Our results 
ly-^ ' also show that for some classes of relay channels and at low multiplexing gains the diversity orders of the dynamic 

decode-and-forward protocol protocol are greater than those of the static compress-and-forward protocol which in 
turn is known to be tradeoff optimal over all static half duplex protocols. In general, the dynamic decode-and-forward 
00 , protocol has a performance that is comparable to that of the static compress-and-forward protocol which, unlike the 

dynamic decode-and-forward protocol, requires global channel state information at the relay node. Its performance 
is also close to that of the decode-and-forward protocol over the full-duplex relay channel thereby indicating that the 
half-duplex constraint can be compensated for by the dynamic operation of the relay wherein the relay switches from 
the receive to the transmit mode based on the source-relay channel quality. 
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I. Introduction 

Higher transmission rates and increased reliability or quality-of-service are two of the most important goals in the 
design of wireless communication systems. Techniques enabling the simultaneous realization of higher transmission 
rates and reliability include the employment of multiple antennas at the receiver and the transmitter and cooperation 
or relaying among users of the network. In this paper, our interest is on a three-terminal relay network with the source, 
relay and destination each equipped with multiple, and possibly distinct, number of antennas. One application that 
is being considered for relaying, for example, is the potential for expanded throughput and coverage for broadband 
wireless access (IEEE 802.16) with rapid and low cost deployment of relay stations of complexity and cost lower 
than that of legacy base stations but higher than that of mobile stations 0. 

Communication theoretic results on relaying in wireless channels can be found in 0, 0, 0, 0, J6), Q, 
0, for ergodic 0, and outage settings 0, 0, 0, 0, 0, with 0, 0, 0, 0, characterizing 
the diversity-multiplexing tradeoff (DMT), a high SNR metric originally proposed for the multiple-input, multiple- 
output (MIMO) Rayleigh fading point-to-point links in iflOl . of several increasingly high performance half-duplex 
(HD) relaying protocols for single antenna terminals in 0, 0, Q, 0, and for the case of a multiple-antenna 
destination in 0. Of these protocols, the one that concerns us in this work is the so-called dynamic decode-and- 
forward (DDF) protocol of [8| but in the much more general context of a relay network with multiple and arbitrary 
number of antennas at each of the three nodes. The word "dynamic" in dynamic decode-and-forward highlights 
the feature of this protocol wherein the relay listens for a source-to-relay-channel-dependent fraction of a frame 
before deciding to transmit to the destination. Protocols wherein the relay listens for an a priori fixed fraction of 
the frame length are called static protocols. 

Relay networks with multiple antenna nodes were first considered in ifTTl where the authors analyzed the 
performance of a number of cooperative protocols and showed that the compress-and-forward (CF) protocol attains 
the fundamental DMT of both the full-duplex (FD) and the static half-duplex relay networks. Our choice of the 
DDF protocol however, is based on the fact that the CF protocol requires that the relay have perfect and global 
channel knowledge (i.e., the channel matrices between each of the three pairs of nodes) which may be difficult or 
even impossible to realize in practice. Moreover, while practical finite-length coding/decoding schemes based on 
the DDF protocol (cf. Ifl2l and the references therein) exist, no corresponding code has been found - to the best 
of our knowledge - for the CF protocol. In contrast to the CF protocol, the DDF protocol requires the relay node 
to merely know its incoming channel. This suggests a possible performance-complexity tradeoff between the CF 
and DDF protocols which can be illuminated in the high SNR regime by providing the DMT achievable by both 
these protocols on the half duplex MIMO relay channel. While IfTTl proved the optimality of the static CF (SCF) 
protocol on the half-duplex relay channel, it did not provide an explicit DMT of the half-duplex channel under 
the constraint on protocols being static. More recently, explicit DMT characterizations of the SCF protocol and 
the DDF protocol for the so-called symmetric half-duplex MIMO relay channel, in which the number of antennas 
at the source and destination are equal, were reported in [13] and by the authors in a conference version of this 
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paper in fl4l . respectively. Moreover, the work on the DDF protocol in lfT4l was generalized to the relay channel 
with an arbitrary number of antennas at the three nodes in a second conference version of this paper in (T5|, and 
independently and at almost the same time, the authors of fOI obtained a similar generalization of their work on the 
SCF protocol in lfl6l . Furthermore, there is one key enabling analytical tool that is common to both works, namely 
the specification of the joint distribution of the eigenvalues of two specially correlated Wishart matrices, but the 
methods employed to solve this problem are different in lfl6l and in this paper (first reported in 11151 ), Moreover, the 
dynamic nature of the DDF protocol considered in this work introduces another source of difficulty in the analysis 
that is not encountered in the analysis of the SCF protocol in |fl6| . Note that the generalization to the relay channel 
with an arbitrary number of nodes at the source, relay and destination is not only mathematically interesting but it 
is also a practically important problem. For example, this extra generality is critical in the application of relaying 
in broadband wireless access where the three nodes are envisioned to have unequal number of antennas and 
computational capability. Other potential practical examples of cooperative networks that involve terminals with 
different numbers of antennas are also detailed in Section [VI] along with a comparative DMT performance of the 
DDF protocol with non-cooperative communication as well as with full-duplex DF (FD-DF) and the SCF protocols. 

As stated earlier, it was found in ifTTI that the SCF protocol is DMT optimal on a relay channel under the constraint 
that the relay node operates statically. This doesn't of course preclude the DDF protocol from outperforming the 
SCF protocol since in the DDF protocol the relay operates in the dynamic mode. Indeed, comparing the DMT 
curves of the DDF protocol with that of the SCF protocol, it is found that for some channel configurations and 
at lower multiplexing gains, the DDF protocol does in fact achieve higher diversity orders than the SCF protocol. 
This proves that a half-duplex relay node operating via a static protocol prevents optimal performance over the HD 
channel. That the DDF protocol does not always perform uniformly better than the SCF protocol can be explained 
from the fact that the DF strategy is itself in general sub-optimal for static half-duplex and full duplex relaying ATI . 
While allowing dynamic operation improves the DF strategy in low multiplexing gain regimes sometimes beyond 
even that of the SCF protocol, the superior performance of the SCF protocol over its DF counterpart persists in 
spite of allowing dynamic operation for high multiplexing gains. While performance improvement over the DDF 
protocol was sought within the framework of dynamic operation of the relay and the decode-and-forward strategy 
in (9) for a relay channel with single antennas nodes, we do not pursue this improvement in this paper for the case 
of multiple antenna nodes. 

Comparison with the DMT performance of the full duplex decode-and-forward (FD-DF) protocol also reveals 
an interesting fact. For a number of cases depending on the relative numbers of antennas at the three nodes, the 
optimal DMTs of the FD-DF and the DDF protocol can be nearly equal. In these cases therefore, the extra cost of 
full duplex relaying (due to enabling simultaneous transmission and reception) can be completely offset relative to 
half-duplex relaying by allowing dynamic operation. 

It is also noteworthy that the application of the DDF protocol is not only limited to the relay channel. In IfTTI . 
it was shown that the DDF protocol is optimal on both a relay channel with automatic-retransmission-request 
(ARQ) protocol and a multiple-access-channel with a relay (MAR) and ARQ, with single antenna nodes. This 
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encourages one to further analyze the performance of the protocol on these channels with multiple antenna nodes. 
The performance analysis of the DDF protocol on a MIMO half-duplex three node relay channel can be seen to 
provide the first step in that direction. 

The rest of the paper is organized as follows. In Section HU we describe the system model and the DDF protocol. 
In Section [TTTJ we provide the eigenvalue distribution result using which, in Section IIVI we derive the outage 
probability of the DDF protocol and specify the optimization problem whose solution is its DMT. In Section [Vl 
closed-form solutions for three simple channel configurations are provided, following which explicit DMT curves 
are provided using these methods for a few more channel configurations in Section [VI] Section I VIII concludes the 
paper. 

Notations: (x) + , x Ay, \X\ \X\ and (Xy represent max{0,x}, the minimum of x and y, the size of the set 
X, the determinant, and the conjugate transpose of the matrix, X, respectively. Let M. and C denote the real and 
complex number fields, respectively, and C nxm the set of all n X m matrices with complex entries. The interval 
containing all real numbers between x and y will be denoted by [x,y], i.e., [x,y] = {z e K : x < z < y}. 
Similarly we denote the set {z e K : x < z < y} by (x,y]. The empty set is denoted by $. Let 1 
represent a matrix in C Ml xNl , where a,^ represents the element in the i th row and j th column. If Xi,X2, • • • ,x u 
represents a set of real numbers then x represents the vector whose components are a;,s, i.e., x = [xi, X2, • • • , x u ]. 
The Vandermonde matrix [ir 1 formed from the vector x = [xi,x%,- ■ ■ ,x u ] will be denoted by Vi(S). 

The probability distribution of a complex Gaussian random variable with zero mean and unit variance is denoted 
by CAf(Q, 1). The symbol diag(.) represents a square diagonal matrix of corresponding size with the elements in 
its argument on the diagonal and I n denotes annxn identity matrix. The probability of an event £ is denoted as 
Pr(£). All the logarithms in this text are to the base 2. Finally, any two functions /(/?) and g(p) of p, where p is 
the signal-to-noise ratio (SNR) defined later, are said to be exponentially equal and denoted as f(p)=g(p) if, 

lim !gim = lim lQ S^ , 

p^oo log(p) p->oo log(p) 

< and > signs are defined similarly. We also define the following function 

0, if x < y; 

f(x,y) = \ (1) 
+00, if x > y. 



II. System Model 

Consider a quasi-static Rayleigh faded MIMO relay channel with a single relay node as shown in Figure Q] 
where the source, the destination and the relay node have m, n and k antennas, respectively. Let Hsr £ C fexm , 
Hsd € C nxm and Hrd e C nxk represent the channel matrices between source and relay, source and destination 
and relay and destination, respectively. For economy of notation these channel matrices will be denoted by H 
collectively, i.e., H = {Hsr, Hgjj, Hrd}. The quasi-static fading assumption implies that these channel coefficient 
matrices remain fixed for the entire duration of a codeword and change independently from one codeword to the next. 
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Fig. 1. System model of the [m, k, n) MIMO relay channel. 

All these matrices are assumed to be mutually independent and the elements of these matrices are independently 
and identically distributed (i.i.d.) as CAf(0, 1), thus modeling Rayleigh fading. Let the channel state information 
be perfectly known at the receivers but unknown at the transmitters. Suppose that independent random Gaussian 
codes are used by both the source and the relay node. 

A. The DDF protocol 

The DDF protocol was proposed and analyzed in J8) for an HD relay channel with single-antenna nodes. In 
this protocol, the relay node has two phases of operation. In the first phase, the relay node listens to the source 
transmission and decodes it as soon it receives enough mutual information to do so. In particular, if I is the minimum 
integer such that / log(dct(/„ + pHsR^ Hsr)) > IR, where R is the rate of transmission in bits per channel use, p 
is the signal-to-noise ratio (SNR) of the source to relay link and I is the block length of the source codeword, then 
the first phase ends at the l-th channel use. During the first phase the relay node does not transmit. The second 
phase starts from the (I + l)-st channel use and lasts for the rest of the source transmission (i.e., it consists of I — I 
channel uses). In what follows, I is called the relay decision time as in |[T2l . During the second phase, the relay 
re-encodes the source message using an independent codebook and transmits it during the rest of the codeword. 
Note that the relay node can help by transmitting an independent copy of the message to the destination only if 
I < I. Otherwise, it does not participate in the cooperation and the channel behaves like a point-to-point (PtP) 
channel. Clearly, in this scheme, the source codeword X$ consists of two parts (X$ = [Xsi, Xs 2 ]), the first part 
(X S1 e C mx <' Ai )) is sent by the source while the relay is listening and the second part (X$2 £ C mX " C Ai )) is 
transmitted while the relay is transmitting its own codeword Xr 6 ^kx(l-(lAl)) ^ T nus the received signal at the 
relay and the destination in phase one can be written as 

Yxd = s/pHsdXsi + JVi, Yxd G C" x ^ Ai \ 
Y R = ^pH SR Xsi + N R , Y R g C fcx C" a; ) , 
and the received signal at the destination in phase two is given by 

Y 2D = ^H SD X S2 + ^Hr D Xr + N 2 , Y 2D G C" x(i - ([Ai)) , 
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where N± <G C" x(iAi) , N 2 G C nx(7 ~ (iAi)) and N R g C fcx(i " AZ) represent the additive noises at the destination during 
the first and second phases and at the relay, respectively. All the entries of N\, N 2 and Nr are assumed to be i.i.d. 
CAf(0, 1). Besides assuming channel state information at the receivers, we assume for simplicity, as does [8|, that 
the destination has perfect (genie-aided) knowledge of the relay decision time I which of course is a function of the 
source-to-relay channel Hs Jl . Further, to ensure that p represents the SNR of each link, we impose the following 
constraints on the covariance matrices of the inputs: 

E (Xs,rXl = I r ,ixm an d E [Xii^X^rA = Ifcxfc; ^T, (2) 

where XgT an d Xrt represent the T-th column of the source and the relay codeword, respectively. Let us also 
define the ratio min ^'^ by /, i.e., / represents the fraction of time for which the relay node listens before starting 
its own transmission, if it can decode the source transmission. In Section IIV1 we shall see that this parameter / 
plays an important role in the formulation of an appropriate outage event. Note that / is defined in terms of I which 
is a function of the rate of transmission R, and source-to-relay channel Hsr, making it a random variable. In what 
follows, we derive the dependence of / on the channel matrices more rigorously. From the definition of I specified 
earlier, we get 

IR 



1 = 



log(det(/ n + pHsr^Hsr)) 

lr\og{p) 
log(p)E'=i(l-7i)- 



[■•• R = r\og(p)} 

(3) 

where p~ li = w,-, 1 < i < minjfc, m} = t, v\ > v 2 > ■ ■ ■ > v t > are the ordered eigenvalues of the central 
Wishart matrix H^ sr Hsr and r is the multiplexing gain. Putting this into the definition of /, in the limit when 

I —> oo, we get 

Besides /, computation of the outage probability will also involve the joint distribution of 2 Wishart matrices 
mutually correlated in a special way. In the next section, we shall describe the structure of this correlation and 
compute the corresponding joint distribution. 

III. Joint Eigenvalue distribution of two correlated matrices 

In the DMT analysis we need only the asymptotic (in SNR) distribution of the eigenvalues of the matrices 
appearing in the outage formulation. In this section, we shall derive the joint distribution of the eigenvalues of 

1 The assumption of genie-aided relay decision time (and infinite codeword length) is relaxed and addressed rigorously for the single-antenna 
relay channel in the recent work of [12] where it is shown that there is no loss of diversity-multiplexing tradeoff optimality if the relay does 
not convey the side information about relay decision time but the decoder at the destination jointly decodes the decision time and the message. 

2 Note that vt > with probability 1 (w.p.l). 
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two such mutually correlated Wishart matrices. Mathematically, the asymptotic behavior of the eigenvalues of a 
random matrix is captured, following iflOl . as shown below. Letting the ordered eigenvalues of a matrix of interest 
be denoted by 7Ti > ir 2 > • • • > n u , the asymptotic nature of the eigenvalues is characterized by Si's, where 

tt 4 = p- s \ l<i<u. (5) 

Eventually, in this section we shall derive the joint distributions of these Si's; the following Theorem is the first 
step in that direction. 

Theorem 1: Let Hi <E <C N2xNl and H 2 G C N2XN3 be two mutually independent random matrices with i.i.d. 
CM (0,1) entries. Suppose that £1 > £2 > ■ ■ • £g > and Ai > A2 > ■ • • X p > are the ordered non-zero 
eigenvalues (w.p.l) of V\ = h\(In 2 + pH^H^j^Hx and Vi = H 2 H\, respectively, with p = mhi{A2, A^} and 
q = min{iVi, A^}, and where all the eigenvalues are assumed to vary exponentially with SNR in the sense of 
equation (0. Then, the conditional asymptotic probability density function (pdf) of the eigenvalues £ given A is 
given as 

q (p,q) v 1 (N 2 -j)AN 3 . _ P f. X \ 

hm^mf M ' 2j) ^) n (e-^m^+p^u n (-hr-)- 

3 = 1 ( u =l,v=l) i=l J = l t=l ^ 1 ' 

((u+v)=(N 2 +l)) 

Proof (Outline): Let the singular-value-decomposition (SVD) of V 2 be V 2 = WhU, where U € C N2xN ' 2 is 
a unitary matrix and A = diag([Ai, A2 • ■ ■ Ajv 2 ]), where Ai > A2 > ■ ■ ■ Xn 2 are the eigenvalues of V 2 (Note that 
(N 2 —p) of these are w.p.l). Denoting S = (In 2 an d H\ = UH\, V\ can be written as V\ = h\H\ where 

can be thought of as a channel matrix of N\ transmit antennas and N 2 receive antenna MIMO 
channel, where the channel is correlated at the receiver only, with the covariance matrix being S. The eigenvalue 
distribution of V\ for such a channel was derived in lfl8l for N 2 > N\ and in fl9l for N 2 < N\, respectively. 
However, the expressions for the corresponding distributions given in fTT~8l and |fl9l can not be used directly for the 
DMT calculation as they involve ratio of determinants whose components are hypergeometric functions. Fortunately, 
for high SNR values these expressions can be simplified. A detailed proof is given in Appendix lAl ■ 
Corollary 1: The joint pdf of £ and A is given as 

f(C,X)=f[((l + pXi)^e-^Xi^ +N ^) n ( e -^A u ) 

i=l (u=l,«=l) 

((u+t,)=(JV a +X)) 

q q (N 2 -j)AN 3 . - p tX \ 

3=1 3=1 »=i v py 7 

Proof of CorollaryUJ The joint distribution of the ordered eigenvalues of V 2 = H 2 H\ is given in |[T9l , which 
for asymptotically high p values becomes 

f2(A>n e - A *A 4 ^ + ^- 2i >. 

■i=l 

Using this marginal distribution of A along with the conditional distribution of Theorem Q] we get ©. ■ 
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Now, using the transformations Aj = p~ a% for 1 < i < p and £j = p~&' for 1 < j < q in equation © we get 
the following 

Theorem 2: If the non-zero ordered eigenvalues of V\ = H\{In 2 + p^H^) ~ 1 H\ and V2 = are denoted 

by £j = p~" j , 1 < j < q and A^ = p~ ai , 1 < i < p, respectively, where H± and H2 are as in Theorem Q] then 
the joint distribution of a and j3 is given by 

, if (aJ)eA; 

g(a,P)=S _ (7) 

0, if (a,/3) i A, 



and 



where A = Ua,/3) : {an + ft) > 1, V(z + j) > (7V 2 + 1); < ai < • • • < a p : < ft < • • • < 

B (a, ^) = ((^3 + ^2 - 2z + 1)04 - JVi(l - + ^2 - 2j + l)ft 

«=1 J=l 

9 (JV 2 -i)AJV 3 

+ E E (8) 

i=i i=i 

Proof: The proof is given in Appendix |B] ■ 
This joint pdf of {a, m will be used in the next section to compute the probability of an appropriately defined 
outage event. The scope of the asymptotic joint pdf derived in this section however is not restricted to the results 
derived in this paper. Although the correlation between the two Wishart matrices has a specific structure, it may 
arise in different communication problems. For example, a similar correlation structure is encountered in the outage 
analysis of the 2-user Z interference channel and the result of this paper was used to derive the DMT of that channel 
in lH. 

IV. DMT OF THE DDF PROTOCOL 

The optimal diversity order of a coding scheme, at a given multiplexing gain, is defined as the negative SNR 
exponent of the average codeword error probability averaged over the channel realizations. Thus to derive the DMT 
of a coding scheme it is important to first compute the average codeword error probability. In this section, we shall 
derive the best achievable diversity order of the DDF protocol in the following two steps: first, we shall show that 
the probability of error is exponentially equal to the probability of an appropriately defined outage event, O; and 
then we shall compute the negative SNR exponent of this outage probability, Pr(O). It is the second step, where 
we shall have to use the distribution result derived in the previous section. 

Let the average probability of codeword error of the DDF protocol, achievable over the MIMO relay channel at 
a given SNR, p and minimized over all possible coding schemes, be denoted by P^{p), i.e., 

P*{p) = min P£ (p) , (9) 

EK " {C(p)^( P )} E 

where P^ 1 ^ represents the probability of codeword error achievable by a particular coding scheme C(p) and 
^(p) represents the family of possible codes at SNR p. Then the optimal diversity order, denoted by d*(r), at a 
multiplexing gain of r is defined as 

«T(r) = lim ZMEIM. (10) 

p->oo log(p) 
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A. Probability of codeword error 

The computation of the average codeword error probability P E of the DDF protocol is divided in two parts 
depending on whether the relay node participates in the end-to-end communication or not. First, we consider the 
case where the relay node helps by cooperating (when / < 1) and then we consider the case where the relay node 
does not participate in the communication (when / = 1). We start with the first case. 

. Achievability (P|<Pr(0))B 
Let us assume that the source and relay use independent Gaussian codebooks and denote the conditional codeword 
error probability by Pe\h, where 

p E\n= E Pv(C g (p))Pv(Xs,Xs)Pt(Xs^Xs\Xs,X s ,H). (11) 

{Cg(p)} {x s .x s ec Gs ( P )} 

That is the error probability is computed conditioned on a channel realization, H and averaged over all ensembles 
of Gaussian codebooks having codeword length I and cardinality 2 Lrlog ( p \ This error probability can be upper 
bounded using Bayes' rule as, 

Pe\H= Pe,E^\H + PE.E r \H 

<PE\Ec,H + P Er \H, (12) 

where E r and represent the events of relay error and its complement. We know from equation (0 that the 
assumption / < 1 is equivalent to saying that the source-to-relay link is not in outage. On the other hand, on 
a delay limited point-to-point (PtP) fading channel the best achievable probability of error is essentially equal to 
the so called information outage probability. Therefore, since we assume sufficiently large block length for the 
codewords used by the source, it can be easily proved that Pe t \h < e > f° r anv e > and / < 1. 

Remark 1: The fact that Pe t \h < £ i f° r an y e > 0, / < 1 and I — > oo was proved in ifPTl (Lemma 1) for a 
relay channel with single antenna nodes. The proof for the M1MO case is identical. In what follows, we provide an 
outline of the proof. Suppose, a codeword is divided into TV segments of length L each, i.e. I = LN, where both 
L and N both grow to infinity and the ML decoder at the relay waits for N such segments to decode the message 
from the source, where N is given as 



N 



NR. 



(13) 



_I(Xs,t] Yr,t)_ 

where I(Xs,t', Yr,t) represents the mutual information between the source and relay node at time T. Note that this 
P\,Xs } t] Yr,t) is same for all T since the input and noise are identically distributed across time and the channel 
is fixed for the entire codeword. From equation (0 and the fact that / < 1 we get that NR < NI(Xs.t',Yr i t)- 
Now, Pe v \h as defined before represents the conditional probability of error on the PtP channel from the source 
to the relay node. Pe,.\h can b e upper bounded by replacing the ML decoder by a typical set decoder and then 

3 The outage event O will be denned shortly. 
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taking the average over the ensemble of codebooks. Then, following a method similar to that in in 12T1 (Theorem 
10.1.1) it can be shown that 

P Er \H < 2e + 2 3U 2- l ^ x ^ Yr ^- n ^ < 3e, (14) 
for sufficiently large L and any e > 0. 

Using this fact in equation ( fl2] i and averaging both sides with respect to channel coefficients, in the high SNR limit 
we get 

Pe < E H (P E 

By the preceding argument, after I channel uses the relay node can decode the source message, where I = fl < I. 
Suppose the relay node encodes the message into a codeword from its own codebook and starts transmitting it 
from the (I + l)-th symbol. Thus, for the first I channel uses the relay channel essentially behaves like an m x n 
point-to-point channel and for the rest (Z — I) channel uses it behaves like an (to + k) x n point-to-point channel. 
Since the source and the relay use independent random Gaussian codes, averaging over the ensemble of random 
Gaussian codes, it can be easily proved El that the pairwise error probability for a given channel realization, 
Ppe\e c ,h is upper bounded as follows: 

Ppe\E',h < det (/„ + ^-Hl D H SD ) 1 dot (/„ + JL(Hl D H SD + H^Bbd)) ' ' (16) 



Remark 2: The subtle difference between Ppe\e c ,h an d Pe\e c .h^ as defined in (fTTT i should be noted. In the 
former, the averaging within a particular codebook is not done. However, the two can be related through the well 
known union bound as follows 

Pe\E£,H < \C\PpE\Ee,H, 

where \C\ represents the cardinality of the codebooks. 

Recall that the cardinality of the codebooks were assumed to be 2 rlXo ^ = p w . Thus, using the union bound of 
probability of error in equation ( TToT l we get 



P E \Ei,H < p lT Pp E \ E c,H 



In + 7^ H SD H Sd] 



p r dot (/„ + £-(Hl D H SD + Hl D H RD ) 



-(1-/) 



(17) 



Now, if we define O in the following way, 



O = \ (l,H SD ,H RD ) :I(H) 4 / log (det(/„ + £-hI d H 8 d. 



2n 



+ (1 - /)log (det(I n + JL{Hl D H SD + H^Hrd))) < rlog(p) j, (18) 
then it is evident from equation (T7\ that 

Pe\0". E c 0j a s / — y oc. (19) 
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Finally, from equation ( fl5] l we have 

Pe < Pe\e°i 

= Pe\o;e° Pr(O c ) + Pe\o,e« Pr(O), 

< P £ | 0c . B cPr(O c )+Pr(0), 

< Pr(O), (20) 

where step (a) follows from Bayes' rule and in the last step we used equation dl9V Since Pe represents the average 
probability of error averaged over ensemble of codes, there exist a code for which d2Qb is true. Denoting the average 
probability of error for such a code by P e where the averaging is now over only the fading states, we have 

(«) 

P*E < Pe<¥r{0) (21) 

where step (a) in the above equation follows from the fact that P E represents the minimum probability of error 
among all possible coding schemes and in the preceding analysis we have only considered Gaussian codes. The 
above equation establishes an upper bound on P E . Next we derive a lower bound on P E . 

. Converse (P E >Pr(0)): 

Consider a genie aided relay channel where the genie gives the source message to the relay node after I channel 
uses. In the presence of such a genie the relay channel becomes a composite point-to-point channel, where for the 
latter (1 — /) fraction of the codeword, the relay and the source node together acts as the composite source. Clearly, 
1(H) represents the mutual information between the source and the destination node and consequently O represents 
the outage event of the genie aided composite point-to-point MIMO channel. Thus using Fano's inequality as in ifTol 
and the fact that the real system has a larger error probability than the genie-aided one, we get 

Pr(O) < P* (genie) = min P e (genie) < min P e = P%, (22) 

all coding schems all coding schems 

where P e and P e (genie) represent the probability of error of the actual and genie aided system for any particular 
coding scheme. Finally, combining (l22l and (I2TI 1 we get 

P E =Vv(0), for/<l. (23) 

Next we consider the case when / = 1. From the definition of / in equation (01 we know that when / = 1, the 
relay node does not take any part in the communication from the source to the destination. In this case, the relay 
channel becomes a point-to-point MIMO channel. It was shown in 1101 that P E of such a channel is exponentially 
equal to the corresponding outage probability. Putting / = 1 in our definition of O we get 

/=1 = [h SD : log (det(I„ + -^hI d H S d)) < rlog(p)} . (24) 

This is same as the outage event defined in iflOl for a point-to-point channel having channel matrix H$ d and thus 
using the result of IflOl we get 

P e =Pt(0) , for / = 1. (25) 
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Finally, combining the last equation with equation d23l we get the following Theorem. 

Theorem 3: The minimum (among all coding schemes) probability of codeword error, P| of the DDF protocol 
is exponentially equal to the probability of the event O defined in (fT8l . i.e. 

P*e= Pr(0) (26) 

B. SNR exponent o/Pr(C) 

In what follows we shall refer to O as the outage event and Pr(O) as the outage probability. By definition ([Tol l 
and Theorem [3] it is clear that the the negative SNR exponent of the outage probability is equal to the optimal 
diversity order of the DDF protocol, i.e., 

-log(Pr(Q)) 

d (r) = hm — — (27) 

p->oo lOg(p) 

For asymptotically high value of p, 1(H) can be written as 

I(H)= / log (dot (l n + pHsdHIbJ) + (1 - /) log (dot (l n + p(H SD Hl D + H RD H RD ) 
= log (det(J„ + pH 8D Hl D j) + (1 - /) log (det(/„ + P H RD H RD (I n + pHsoH^)- 1 
= log (dct(/„ + pH SD Hl D )) + (1 - /) log (det(/ ft + P H RD (I n + P H SD Hl D )- l H RD 

Note that in the above expression / depends on 7 through equation ®. The distribution of 7 for asymptotic p is 
given by iflOl 

h( 7 )=<^ (28) 

{ 0, if 7 i v, 

where T> = {0 < 71 < 72 < • • • 7*}. Putting H2 = Hsd and Hi = H RR) in Theorem|2] the above expression can 
be written as 



where the joint pdf of a and /? is given by equation (0. Substituting this equivalent expression for 1(H) into the 
definition of the outage event we see that the outage probability Pr(O) depends on the different channel matrices 
only through the joint distribution of a, f3 and 7. Further, since 7 is independent of (a, /3), the outage probability 
can be written as 

Pr(G)= / g(aj)h(7)dad^d 7 , (30) 

4 Because 7 is a function of Hsu whereas (a, /3) is a function of Hrd and only, and does not depend on H^r 
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where O is given, using ( fT8] l and ( |29l i, as 



O 



3=1 



=/; 



(3D 



< min ( 1, — r 

" I E?=i(i-tO 



(32) 



Finally, evaluating the integral in equation ( f30b we get the following theorem. 

Theorem 4: The optimal diversity order, d*(r), of the DDF protocol at any multiplexing gain r is given by 



d*(r) = min j fd(r) + p(r, i)) , ui m ,nO") + d fc>m (r)) | , for < r < min{m, n}, 



(33) 



where the ip(-, •) function is defined as in (fTJ, d m n (r) represents the diversity order of a MIMO PtP channel at a 
multiplexing gain of r iflOl and 

d(r) = min 



Si GO 



l<i<3 {jeR,, 6eB«(j/)} 

y(n - r ) 



F[4> a [r-b[l-~) ) ,Mb),^(y) 



0, 



= r, 



r 



(n — ?*) 



; B 2 {y) = [0,g]; B 3 fo) 



ft 2 = 



(n — r) ' (q — r) 



' (y-r) 
K 3 = 



qr 



(q-r) 



■t 



with 



(34) 



i=l 

and the vectors Q (-), 0,3 (•) and 7 ( ) defined in Appendix C in equations d66l ). ( f6Tb and d68l ). respectively. 
Proof (Outline): The probability of outage can be expressed as 

Pr (0) = Pr (0\f < 1) Pr (/ < 1) + Pr [0\f = 1) Pr (/ = 1) . 

Using the result from ifTUl in equation (124-b we get 



Pr(0|/ = l)=pJ]T(l-« l ) + < 



-d m , n (r) 



From the definition of / in equation (O and equation d28l i. we get 



Pr (/ = 1) = Pr - 7i )+ < r j =p~ d ^ . 

Now, using the fact that / £ [0, 1] with equation ( |37| ) we get 

Pr(/ < 1) = 1-Pr(/ = l)=l- /0 - d -.'=('-)=p-v('-.t) ) 



(35) 



(36) 



(37) 



(38) 



because for r > t, d m ,k(r) = 0. Finally, denoting the negative SNR exponent of Pr(C|/ < 1) by d(r), i.e., 
Pr (0\f < 1) =/9~ d(r) , and combining it with equations ([35]) , (O, ([37]) and (O we get 

. - U(r)+v(r,i) ) -(<U,»{r)+i*,m(r)J 
Pr(C)=p V / + p \ ) , 
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which imply 

d*(r) = min { (d(r) + tp(r, «)) , (d m , n (r) + d fc , m (r)) } . 

To complete the proof it is only necessary to compute d(r), for which we need to evaluate the integral in equation 
(f3Qb under the constraint / < 1. This integral can be evaluated using Laplace's method of integration as in iflOl to 
get d(r) as the minimum value of the negative SNR exponent of the pdf of (a, f3, 7), minimized over the intersection 
of the outage set and the support set of the pdf. Evaluating this minimum value directly is not prescribed for two 
reasons: 1) it is not a standard convex optimization problem; and 2) the number of optimizing variables increase 
linearly with the number of antennas at all the nodes (i.e., with (m + k + n)). To overcome these problems we first 
transform the original minimization problem into an equivalent optimization problem having only three variables 
and then, analyzing it further, we eventually get the much simpler convex optimization problem given in the theorem 
statement involving only two variables. 

This is done in Appendix Icl ■ 
Note that the main step in computing the DMT of the DDF protocol for any given antenna configuration is the 
computation of d(r). We illustrate this step by an example. 

Example 1 (DMT on the (1, 1, 1) channel): Putting n = m = k = lin the expression for F we get 

G(a, b,y)=F (0 a (o), ^(6), c/> 7 {y)) = 2(1 - r) + 2b (l - - b + (1 - y). 

From Theorem |4] we know that the above objective function has to be minimized over three different set of (a, b) 
pairs depending on the value of y. Since 

r\ 1 

yj 2 

the objective function attains its minimum at the maximum value of b in B{y), i.e., b* = y (— -J. Putting this into 
the objective function we have 

G(a\b%y)=y(^^+(l-y) = l+y(^^y (39) 
It is clear that the optimal value of y e 1Z\ = [r, y-^ that minimizes the above function is given as 

r for r < 5 ; 
1, forr>i. 
Putting this solution in equation (f39b we get 

f 2(1 -r), for0<r < i; 

d(n 1 ) = { h " " 2 (40) 

Since for m = n = k, H-2 = next we consider the case when y e IZ3 = ^j-^, 1 ■ This set is non-empty 
only for r < 4 and the optimal point lies on the line segment DF in Figure |8(c)| Dividing the set 7^3 further into 
two subsets IZ31 — (r, 2r] and 7^.32 = (2r, 1] we see that (1 — - J < | when y e 72.31 and ( 1 — - J > | when 
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G{a\b\y) 



b * = _yr_ 

y-r 



y € 1Zz2- The objective function attains its minimum value at point F, where b is maximum when y € IZ31 and at 
point D, where b is minimum when y € 7^.32 and given by 

for ye K31; 
2(l-r) + (l-y), for for ye R 32 , [b* = 0] 
Now, optimizing this function in the corresponding sets of y we get 

3-4r, for y€ K 31 ; [v y* = 2r] 
2(1 -r), for for y e ft 3 i; [•.• t/* = 1] 



G(a*,&* ;2/ *) 
which in turn implies 



d(7e 3 ) = 2(1 -r), for < r < -. (41) 
Combining equations (|40T > and PTt we get 

2(1 -r), for0<r<±; 



d(r) = mm{d(llo),d{K 1 ),d(K 3 )} 



I 1 ?), fori<r<l. 



Using this in Theorem [4] we see that the optimal diversity order of the DDF protocol on a (1, 1, 1) channel is given 
by d(r), thereby recovering the result of J8J . 

V. Closed form expressions for the DMT of a few simple relay channels 

In this section, we shall provide closed form expressions for the DMT of the DDF protocol for three more general 
channel configurations (than the previous example), namely, the (n, 1, n) channel, the (1, k, 1) channel (for k > 1) 
and the (2, k, 2) channel (for k > 2) by solving the optimization problem in Theorem [4] 

A. DMT of the (n,l,n) channel 

Theorem 5: The optimal DMT of the DDF protocol on a (n, l,n) half-duplex relay channel for multiplexing 

gains r is 

< r < 1 and n = 1; 



max{ ^ ,r} ' 

d *i( r ) = \ (n-l) 2 + (3?7. -l)(l-r), 0<r<landn>2; (42) 
dn,n(f): 1 < r < n and n > 2. 

Proof: The proof is given in Appendix [D] ■ 
Remark 1: The optimal DMT of the full-duplex decode- and-forward (FD-DF) protocol was derived in |[TTll and 
over a (n, l,n) relay channel it is given as 

min{d(„ + i) jn (r),d„ jn (r) +d„,i(r)},0 < r < 1; 
d„, n (r), 1 < r < n, 

where d„ tm (r) represents the diversity of a n-transmit, rn-receive antenna MIMO channels diversity order at 
multiplexing gain of r. For n > 2, the DMT given by the above equation is identical with that given in Theorem 
Thus, when the relay has a single antenna and the source and destination have n antennas each, the optimal DMT 
of the half-duplex DDF protocol and a FD-DF protocol are identical. 



d DF (r) 



August 10, 2010 



DRAFT 



SUBMITTED, IEEE TRANS. INFORM. TH., AUGUST 2010 



16 



B. DMT of the (l,fc, 1) channel 

Theorem 6: The optimal DMT of the DDF protocol on a (1, k, 1) half-duplex relay channel is 

f (fc + l)(l-r), 0<r<^; 
=1 1 + ^r<r<i; (43) 

Proof: The proof is given in Appendix [E] ■ 
Remark 2: For fc > 2, the optimal DMT of the SCF protocol on a (1, k, 1) channel is given by lfl6l as the 
piece-wise linear curve whose values at three corner points are given as cIscf(0) = (1 + k), dscF{\) = 1 and 
^SCf(I) = 0. Comparing this with the corresponding DMT of the DDF protocol given above we see for < r < i, 
the DDF protocol can achieve better diversity order (see Figure [2]i while requiring less channel state information. 
Moreover, since the SCF protocol achieves the best DMT among all static protocols, it is evident (for example, 
see Figure O that the DDF protocol can perform better than the SCF protocol, that a static protocol is not DMT 
optimal on a MIMO HD relay channel. 




0.2 0.4 0.6 0.8 1 

Multiplexing Gain, r 



Fig. 2. DMT comparison of the DDF and SCF protocol on a (1, 2, 1) relay channel. 

Remark 3: Recently, the fundamental DMT of the (l,k, 1) relay channel was derived by the authors in ll23l 
which is given as 

( (fc + l)(l-r), O^r^^; 
2 (1 - r) , | < r < 1. 

Comparing it with the result of Theorem [6] we see that the DDF protocol can achieve the fundamental DMT of 
the channel for multiplexing gains in the range < r < |. However, DMT optimality of the DDF protocol is not 
restricted to just this channel. In the next section we shall see that the DDF protocol can achieve the fundamental 
DMT of the channel for other antenna configurations also for some multiplexing gains. 
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Remark 4: Note if Theorem [5] and Theorem [6] are specialized to the cases of n = 1 and k = 1, respectively, 
one recovers the result derived in (8). 



C. DMT of the (2,fc,2) channel 

Theorem 7: An upper bound to the optimal DMT of the DDF protocol on a (2, k, 2) relay channel is given by 
the following 

<Mr) + da )Jk (r), < ?~ < 2; 
* + 3 + (* + l)(^E£), 0<r<§; 



' l-r 



mm < 



fc + 6 v 
■4(jfe-l 



< r < 1; 



1- r 

2— r 



i<r<l; 
l<r<|; 



l<r<|; 
|<r<2; 



2(V) 

Proof: The proof is given in Appendix [F| ■ 
Remark 3: The DMT of the DDF protocol on (2, k, 2) channel was also computed using the numerical method 
(described later in this section) for the general (m, k, n) channel. It was observed that the DMT coincides with the 
upper bound given by the above theorem for k < 20. Thus the upper bound of Theorem [7] is tight for k < 20. 
Given the tightness of the bound for k < 20 we conjecture is that the upper bound of Theorem [7] is tight for all 
k > 2. 




(a) (b) 



Fig. 3. Cooperative networks: (a) CNi: A mobile station act as a relay node; (b) CN2: A dedicated relay station acts as the relay node. 

VI. Explicit DMTs of the DDF protocol and a comparative study in practical applications 

In this subsection, we illustrate the advantage of the DDF protocol over PtP communication schemes and provide 
a comparative analysis of its performance with respect to other MIMO cooperative schemes. We consider different 
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DDF protocol 

Opt. DMT on the PTP Ch. 




(2,2,4) relay channel 
(1,1,2) relay channel 



Multiplexing Gain, r 
(a) Uplink Channel 



O 6 



> 4 
Q 



- DDF protocol 
■Opt.DMTonthe PTP Ch. 



1,2,2) relay channel 
(2,1 ,1) relay channel 



Multiplexing Gain, r 
(b) Downlink Channel 



Fig. 4. DMT curves of the DDF protocol on relay channels of CNi. 



DDF protocol 

Opt. DMT on the PTP Ch. 



(2,3,4) relay channel 




(1 ,2,3) relay channel 



0.5 1 1.5 2 

Multiplexing Gain, r 




DDF protocol 

Opt. DMT on the PTP Ch. 



(4,3,2) relay channel 
(3,2,1) relay channel 



(a) Uplink Channel 



0.5 1 1.5 2 

Multiplexing Gain, r 



(b) Downlink Channel 



Fig. 5. DMT curves of the DDF protocol on relay channels of CN2. 



practical scenarios where cooperative communication promises potential gain in overall system performance because 
of which it is being considered by several standardization bodies. Figure [3^ depicts a cellular network wherein, a 
mobile user (or mobile set (MS)) uses another mobile user as the relay station (RS) to communicate its message to 
and from the base station (BS). This cooperative model was first proposed in Q. Figures |4(a)| and |4(b)| represent 
the uplink and down-link performances, respectively, of the DDF protocol in such an environment with respect to 
the fundamental DMT of the corresponding m-transmit, n-receive antennas PtP channel. 

Figure [3J5 depicts a scenario where, in a cellular network (CN), a particular cell area is divided into more than 
one sub-cell and each sub-cell is served by an additional dedicated node (a smaller BS) to provide better quality of 
service. Thus each user in these sub-cells can use these dedicated nodes to relay their messages to and from the BS. 
This is different from the previous cooperative scenario in the sense that this relay stations can host more number 
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of antennas than a mobile set. This configuration is under consideration to be implemented in LTE-advanced and 
WiMAX technologies l24l and being standardized by the IEEE 802.16s relay task group 0"). Figures [5(a)| and [5(b)| 
show the uplink and downlink performances respectively, of the DDF protocol in such a scenario. These figures 
clearly demonstrate the superior performance of the DDF protocol over that of the corresponding MIMO channel. 

Note the asymmetry in terms of number of antennas at different nodes in both of the above applications, which 
points out the importance of analyzing MIMO relay channels with an arbitrary number of antennas at each node. 




O 6 





HD-DDF Protocol 


_ \V 


-x-HD-SCF Protocol 


\ \: 






^(2,4,2) relay channel 


: s X \ ^ : 


^{1 ,2,1) relay channel 

\N 
X V 
\ K 
X N 
X N 

X N 

' " "^-sc 



0.5 1 1.5 2 

Multiplexing Gain ( r ) 



(a) CN3: Sensory network with a mobile relay station (MRS) 



(b) DMT comparison of DDF and SCF protocol 



Fig. 6. Explicit DMTs of the DDF and SCF protocols on relay channels of CN3. 



Among the many cooperative protocols available in the literature, the static CF and full-duplex DF are the two 
protocols that are known to have high performance and whose explicit DMT characterizations on a MIMO relay 
channel are now known. The DMTs of the DDF and the SCF protocols on the sensor network of Figure |6(a)| first 
proposed in 11251 where a more capable mobile relay station (with more antennas) helps several less capable sensor 
nodes to communicate with each other, is plotted in Figure [6(b)] This figure illustrates that on such a relay channel, 
the DDF protocol can outperform the SCF protocol at low multiplexing gains. However, on a relay channel of some 
other CN, such as the downlink of CNi, the SCF protocol performs marginally better than or identical to that of 
the DDF protocol uniformly at all multiplexing gains, as shown in Figure |7(a)| 

The implementation of a protocol in a practical application however, depends on a number of other issues, with an 
important one being the channel state information (CSI) required at different nodes. The SCF protocol, in contrast to 
the DDF protocol, requires global Cs|f| at the relay node, which in some application-such as the downlink channel 
of CNi- may be a challenging or even impossible task. The CSI assumptions for the DDF protocol are that the 
receivers know the incoming channels which is a much milder assumption given this can be accomplished via 
training. It is also assumed in this work that the destination has perfect knowledge of the relay decision time which 
is a function of the source-relay channel. Future work on the DDF protocol for MIMO relay channels in the spirit of 

5 The knowledge of all the instantaneous channel matrices of the relay channel. 
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bringing theory closer to practice would be to relax the assumptions of infinite block length and genie-aided relay 
decision time information at the destination as was done for the single-antenna relay channel in lfl2l . for which the 
results of this paper would serve as a benchmark. The DDF protocol would thus provide a practical alternative to 
the SCF protocol without sacrificing much by way of performance. For instance, on the (4, 2, 2) relay channel of 
Figure |7(a)| the burden of providing global CSI to the relay node which is an MS having limited capability can be 
avoided through negligible loss in diversity order. 

The performance comparison of the DDF protocol with that of the SCF protocol depicted in Figure |6(b)| is 
also interesting in light of a recent result |26l , where it was proved that, on a single-antenna relay channel, a 
static protocol, namely the quantize-and-map protocol, is DMT optimal among all static and dynamic cooperative 
protocols. This result l26l raises a natural question: can a static protocol achieve the fundamental DMT of a 
MIMO half-duplex relay channel? The analysis of this paper shows that the DDF protocol can be better than the 
theoretical limit of static protocols, which is the DMT of the SCF protocol, and thus answers the above question 
in the negative. The comparison in Figure [6(b)| proves that static protocols fundamentally can not fully exploit the 
resources available on a HD MIMO relay channel. 

In Figure |7(b)| we compare the performance of the DDF protocol with the full-duplex DF protocol on the uplink 
channel of Figure EJa). This figure illustrates that the dynamic operation of the half-duplex relay node in the DDF 
protocol can help achieve almost the same or equal performance as in the FD-DF protocol without full-duplex 
relaying. While the large difference between the transmitted and received power levels makes full-duplex operation 
impractical, if not entirely infeasible, the DDF protocol may be implemented with little or no loss in performance 
while avoiding the cost of full-duplex operation. 





DDF Protocol 

-X- SCF Protocol 


(4,2,2) relay channel 


(2,1 ,1) relay channel 





Multiplexing Gain, r 
(a) Downlink Channel 




□- Full-duplex DF Protocol 
DDF Protocol 



(2,2,4) relay channel 
(1,1,2) relay channel 



Multiplexing Gain, r 
(b) Uplink Channel 



Fig. 7. DMT comparison among the DDF, SCF and the FD-DF protocols on relay channels of CNi . 
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VII. Conclusion 

The asymptotic joint eigenvalue distribution of two specially correlated random Wishart matrices was derived, and 
using this result, the optimal diversity-multiplexing tradeoff was obtained of a three node half-duplex MIMO relay 
network, where each node has an arbitrary number of antennas, operating in the dynamic decode-and-forward 
protocol. For several specific channel configurations we computed explicit DMT curves for the protocol and 
compared it with FD-DF and HD-SCF protocols. These comparisons reveal some interesting facts such as, for 
some channel configurations, the optimal DMT of HD-DDF and FD-DF protocols are identical while for other 
channel configurations the diversity orders of the HD-DDF protocol are marginally less than those over FD-DF 
protocol at high multiplexing gains. Further, the comparison with the HD-SCF protocol shows that for some channel 
configurations, at low multiplexing gains, the optimal diversity orders of the HD-DDF protocol are greater than 
those of the corresponding DMT of HD-SCF protocol, which should motivate one to further investigate other 
dynamic protocols such as the dynamic compress-and-forward protocol, on a three node relay channel. This work 
also motivates further research on the generalization of the single-antenna relay channel results in IPT21 to the MIMO 
relay channel by considering finite lengths codes and doing away with the assumption of genie-aided information 
about the relay decision time at the destination. Extending the present analysis for a relay network having multiple 
relay nodes is another topic for future research. 

Appendix A 
Proof of TheoremQ] 

We begin by proving the theorem for N2 < A3 and later extend the proof for N2 > A3. Let Ai > A2 > • • • > 
Xn 2 > represent the ordered non-zercj^ eigenvalues of V? = H^H^. The spectral decomposition of V2 can be 
written as V% — UAU\ where A = diag(A) and U 6 t£N 2 xN 2 j s an uni^y matrix containing the eigenvectors of 
V 2 . We can now write V x = H\t,Hi, where H l = WH ± and £ = (I Na + pA^ 1 . 

Remark 4: Note that since U, A (e.g., see Lemma 2.6 in ||27i ) and Hi are mutually independent so are A and 
Hi. Also, since Hi is unitarily invariant, Hi and Hi are identically distributed. 

Before proceeding further let us recall a result derived in |[T8l which deals with the eigenvalue distribution of a 
random matrix of the similar form. 

Lemma 1 ( lfl8l ): Let x% > X2 > ■ ■ ■ > xm be the ordered non-zero eigenvalues of Z, where Z = XLX^ and 
X € C MxN ,M < N, has mutually independent complex Gaussian vectors as its columns with zero mean and 
covariance Im, and yi > 1/2 > ■ ■ ■ > un be the ordered non-zero eigenvalues of L, then the density function of Z 
is given by 

|Vi(x)||Vi(y)| 

6 The eigenvalues of V2 are distinct and non-zero with probability 1. 
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where Vi(x) and Vi(y) are Vandermonde matrices formed by the vectors x and y, respectively and 

1 VN 



Ai (y,x) 



N-M-l 
UN 



N — M—l — 

Vn e VN 



N-M-l N-M-l 
1 V(N-l) ■■■y {N - 1) V(N-l) 



V(N-l) 



...yN-U-1 



N-M-l - 

Vi e 



y N N - M - x e-% 



(45) 



1 Vi 

Note that Z in the above lemma has the same structure as V\ but is only valid for M < N. Thus assuming N-y < N 2 
we can substitute X = H\ and L = S in Lemma Q] to obtain 

Ai(a,e) 



f(V 1 \a) = C 1 



(46) 



|Vi(OI|Vi(a)|' 

where C\ is a constant independent of all the eigenvalues and a is the vector of the ordered eigenvalues of S, i.e., 

ai = (l + pX(N 2 -i+i)) 1 , for 1 < i < N 2 . 

The conditioning on a in equation (l46"T l is present due to the randomness of £. However, we are interested in the 
distribution of the eigenvalues of V\ and not Vi itself. 

Let us assume that the spectral decomposition of V\ is given as V\ = VUV^, where V £ C NlXjVl is an unitary 
matrix and II = diag(xi, ■ • • xn-l) is the diagonal matrix containing the ordered eigenvalues of V\. It is well known 
that the Jacobian of the transformation Vi i— > (II, V) is given by 



JVi 



j(n,y) = |v 1 (OI 2 =nte-^) 2 



(47) 



i<j 



Using this expression for the Jacobian and equation d46b we can find the conditional joint pdf of (II, V) conditioned 
on a. Then integrating the resulting pdf over the space of unitary matrices we get the joint distribution of the 
eigenvalues as 



fa(f)*) = C 2 |Vi(D| 



2 M*>3 



(48) 



|Vi(OI|Vx(^)| 

Evaluating this expression in general is complicated due to the term Ai (a, £) in the above expression. The following 
Lemma helps us to evaluate it in the high SNR regime. 

Lemma 2: If the eigenvalues {£1, • • • £ni} and {Ai, • • • Xn 2 } var Y exponentially with p, then for asymptotically 
large p the following identity holds 



Ji^ 



Ai(a,£-) 
IVi^HVx^)! f\ 



Ni 



n n 

j=i i=i 



Also, since |Vi(£)| = rii<j(?i — 0) EB' f° r asymptotic high p, using the ordering among the £j's and equation 
©, we have |Vi(0|=n]Ii ^ N '~ i \ Putting these simplified expressions for Vi(£) and J\ in equation d48l > we 
get 



JY 2 



-? J p-(P« J A JV2 + 1 _ J )^(JV 2 +Ar 1 -2j) 



n n 

j=i i=i 



p&Xi 



(49) 



Recall that to use Lemma Q] we had to assume Ni < N 2 ; in what follows, we consider the case when Ni > N 2 . 



Denoting H x = £t#i, Vi = H[HHi can be alternatively written as Vi = H\ H\. Also, the eigenvalues of Vi 
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and Hi&i are the same for each realization of Hi. Since we are only interested in the eigenvalues of V\, it is 
sufficient to compute the joint pdf of the eigenvalues of Wi = H\H\ . Now, Hi G C N2xNl can be thought as 
a channel matrix, semi-correlated at the receiver, of an A^-receive and iVi-transmit antenna MIMO channel. The 
conditional eigen- value distribution of W\ given the eigenvalues- a = [ci, (T2, • • ■ <jn 2 }- of the correlation matrix, 
was found in |fl9l for N2 < Ni and is given by the following 

|Vi(OI|V 2 (a-)| 

where £1 > £2 > • • • > 6v 2 > are the ordered non-zero (w.p.l) eigenvalues of both H\H\ and V\,o\ > 02 • • • > 
crjv 2 > are the ordered eigenvalues of £, C is a constant independent of all the eigenvalues and 

r t .-\N 2 ,N 2 
J i,j=l 

V 2 ( < 7)=V 1 (-[ ( xr 1 ,a 2 - 1 ,... ( T W2 1 ]). (51) 

In what follows, we will simplify the expression given in equation ( f50b for asymptotic high values of SNR (p), 
assuming that the eigenvalues {£1, • • ■ £,n 2 } ar, d {Ai, ■ ■ ■ Aat 2 } vary exponentially with p. For asymptotically high 
p, using the ordering among the £j's, A^s and equation (0, it can be easily shown that |Vi(£)| = Ili^i Ci^ 2 an d 
l v 2(CT)|=IliI 2 i(Mi) (iV2_i) . Finally, the term D(£,a) in equation (g0) can also be simplified using the following 
Lemma. 

Lemma 3: If the eigenvalues {£1 > • • • > £n 2 } an d {Ai > ••• > Ajv 2 } vary exponentially with p, then for 
asymptotic high SNR we have the following identity 

£ N 2 N 2 (N a -j) 

^>)^i[e--K 2 i=n( e ^ e ^ AN2+i - j) )n n i 1 

j=i j=i i=i 

Putting these simplified expressions for |Vi(£)|, | V2 (^") | a nd D(£,a) in equation < f50b we get 



■ N 2 \ / N 2 



Comparing equations d49l i and d52b . the expression for the distribution for N2 < N3, can be written as 

N 2 q q (N 2 -j) , -(p£-\-)\ 

f 2 ^7)=n( i +^)" i n( e - ?3e " (p?3AN2+i -^r 2+JVi " j) )n n ( V - )• ^ 
i=i j=i j=i *=i ^ p ^ Ai ^ 

Recall that, we assumed that N2 < N3 in the foregoing analysis. This assumption was important since it implies 
that all the eigenvalues of V2 are non-zero and distinct. However, if N2 > N3, then exactly (N2 — N3) eigenvalues 
of V2 are zero and we need to consider this case separately. Because if two or more A^s are zero then both the 
numerator and denominator of the terms J\ and J 2 = of equations d48l > and ( |50l l, respectively become zero. 

On the other hand, if only one of the A^s is zero, we can no longer assume that it varies with SNR. In either case 
the foregoing analysis cannot be pursued when N2 > N3. However, both these problems can be overcome by doing 
the analysis in the limit when these (N2 — N3) eigenvalues tend to zero. It can be easily shown (e.g., see Lemma 6 
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in ||29l ) that both J\ and J2 are well-defined in the limit when (N2 — N3) smallest eigenvalues of V2 tend to zero, 
i.e., 

lim J,; = lim Jj (54) 

is well defined, for both i = 1,2. Using this fact and following a similar approach as before we get 



N 2 



f 2 (£|A)= 



9 <? (N2-3) /1 -(p£ A ) 

!™ +p x ^ ni n (e-^e-^ A ^+ i -^^f a+jvi - !w) ) n n 



=na + a)^ 1 n (e-€*e-«* w-)^ »+^) n n ( i__±a — j 

2—1 j — 1 j=l i=l 



where 



if (JV 2 + l-i)<JV 3 ; 



0, Otherwise. 
Combining equations (IB3T l and (IB3T l Theorem Q] is proved. 



Appendix B 
Proof of Theorem[2] 

The ordering among the o^'s and /3j's in A follows from the ordering of the eigenvalues Aj's and ^ 's, respectively. 
Besides the ordering, if (a, /?) £ A then one or more of the following are true 



e -p^j =e -P 



(l-oi-/3j) 



0, if (ati + Pj) < 1; 
--e~ p ~ ai = Oif a x < 0; 
e^ 1 =e- p ~ C>1 = if /?i < 0; 



Since each of the terms on the left hand side of the above equations is a multiplying factor to the asymptotic 
expression of the joint pdf ([6}, it becomes zero if (a, /3) ^ A. On the other hand if (a, /3) € .A, then 

(l + pA i )=p-( 1 - «) + ) 



e" p '=1, Vi, 

'=1, Vj, 
n (i-«i-Pji . 



"=1, V(i + i) = (JV 2 + l). 

Using this along with the Jacobians of the transformations A, = p~ Qi , 1 < i < p and = p~P j , 1 < 7 < g which 
are given by J(aj) = log(ai)p _ai and J(/3j) = \og{^j)p~^ , respectively, into equation (|6]l we get 



{(aj)=p 



( \ a (Ni — ilAJVa 

-( E? = i((A r 3 + A r 2-2i+l)a I -Ari(l-a ! ) + ) + E] =1 (A r i + A r 2-2j + l)/3 j ) A TT / 1 - e _p " 



n n 

j=i i=\ 



(56) 



Also, at high SNR limit for any z, j 

/ 1 - a > - 8 \ 



1, if (oi+jSi) > 1; 

p-(i-a,-fr) jif (oj + jS,-) < 1. 



1 - e~ z 
lim = 1 

z-»0 z 
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Now, using the fact that the product of several converging sequences converges to the product of their individual 
limiting values we get 

q (N 2 -j)AN 3 / _l-a i -p j \ q (N 2 -j)AN 3 

n n ; .. . n n P -^-^ + - m 
j=i 1=1 \ * j j=i i=i 

Finally, putting this in equation d56b we get Theorem [2] 

Appendix C 
Proof of theorem|4] 

As discussed before, here we shall compute the SNR exponent of Pr(0|/ < 1). The proof of this part is 
rather long and hence divided into three steps. We start with an outline of the different parts. In the first part a 
straightforward analysis of the outage probability following a similar method as in ifTol yields d(r) as the minimum 
value of the negative SNR exponent of the corresponding pdf, where the minimization is over the intersection of 
the outage set and the support set of the pdf. In the next step, this problem is then transformed into an equivalent 
one having smaller number of variables which is solved in the final and third step. 

Step 1: Using Laplace's method of integration as in iflOl we get from equation (130b 

<f(r)= min ( E (a, /3) + V (k + m - 2i + ) = min F (aj,j) , (58) 
{(a,/3,7)eons} V ~1 J {(aJ,T)eon4 V 1 

where S = A PI T> represents the support set of the pdf of (a, f3, 7). 

Suppose at a given r, the objective function attains the minimum value for an a € A PI T> where on > 1 for one 

or more i's. Let a = min{[l, 1, ■ ■ • ,1], a}, where the minimization is componentwise. Clearly, a £ ADV but at 

this point E has a strictly smaller value. This proves that in the optimal solution, 014 £ [0, 1] for all i. The same is 

true for f3 and 7. Thus d(r) is given as 



p q 9)( n— j)Am 

d(r) = min _ + m + k - 2i + l)cti - kp+ + k - 2 j + l)/3j + ^ (!-«»- 

3=1 3,i=l 

+ (fc + «^-2^ + l)7 ^ , (59) 



{(a,0,7)ed} l=1 



where 



6=1 (a,p,i) = £(1 - «i) + (!-/)£(!- ft) <r, (60) 

I i=l 3=1 

0</ = -^ -<1, (61) 

(CH+Pj) >l,V(i,i) : (i + j) > + (62) 

0< ai < ■■■ < a p <1, (63) 

0<Pi < ■■■ <0 g <l, (64) 

0<7i<---<7*<lL (65) 
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where equation d6"Tl ) follows from equation (O and the fact that / < 1. Note that the number of optimization 
variables increases linearly with the number of antennas at all the nodes. To overcome this problem, in what 
follows, we transform the previous optimization problem into an equivalent one having a fixed number of variables 
which is independent of the number of antennas at the nodes. 

Step 2: The objective function in (|59l decreases strictly monotonically as a, is decreased for any i and the rate of 
decrease with <x; is smaller for a larger value of i. The same is true for /3 and 7. Thus, following a similar method 
as in [10 , it can be shown that if 5^f =1 (l — oti) = a, JJJ— i(l — Pj) = b, Y^i=i(^ ~ li) = V an d At) satisfy 
equations d63l)-([65l), then the optimal choice of (a, 0, 7) that minimizes F(.) is given by (4> a (a),(j)p(b),(j) 1 (y)), 
where 

4> a {a) = a 2 , • • • , a p ] T : & t = (l - (a - i + ,l<i<p. (66) 

M b ) = J q ] T ■ fc = + i<i<g; (67) 

Mv) = [7i,72,- ■■ ,7t] T : 7i= + 1 < I < t. (68) 

Denoting by T(a, b, y) the following set 

_ p 1 1 _ 

|(a,/3,7) :^(1 - a,) = a,^(l -ft) = k,^ 1 ~ ^) = 2/5 ("'ft 7) satisf y equations d63)-(|65j j , 

i=l j=l i=l 

from the foregoing argument we have 

min F ((a, (3,j)) = F (Ma),Mb),My)) ■ (69) 

{T(a,b,y}} 

Let us now define the following set of new variables 



Oi = Ua,b,y) : a + b I 1- -j < r, (a + b) <n, 0<b<q, r <y <tj . (70) 
It is clear from the definition of T(a, b, y) that, 

6i± (J T(a,b,y)DO. (71) 

{(a,6,y)eOi} 

Since the minimum of a function over a set is not larger than the minimum of that function over a subset of it, the 
above relation along with equation d69l imply 

min F(</> a (a),<f>p(b),(f>y(y)) = min F (a,f},i) < min F(a,/3,*f). (72) 

{(a,6,y)GOi} {(fi,/3,7)e<f>i} {(a,0,j)£O} 

Before proceeding further we take note of a few properties of the newly defined variables a, b and y. From the 
definition of fa's it is clear that if (a,b,y) G 0\, then (<fi a {a), 4>p{b), 4''y{y)) satisfy equations d60b , doTl ) and ( l63l >- 
d65l l. Suppose for some (i+j) = (n+1), (dj+ft) < 1, then it can be shown that <^)+Sj=i (1 — ft) > n ' 

which is impossible. Thus (dj + ft) > 1 for all (i + j) > 1, which in turn imply that the (</> a (a), (j)p(b),<t) 1 (y)) 
tuple also satisfies equation d62b . That is 

(a,b,y)e0 1 ((f) a (a),(f>/3(b),(f)-y(y)) G 0, 

min F(a,/3,7)< min F {(j> a {a), <f>e(b), (j>^{y)) . 

{(a,/3,7)e6} {(a,6,y)GCM 
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Combining this with equation d72l ), we get 



min F(a,(3,j) = min F (0«(a), <j>p(b), 7 (y)) . (73) 
{(a,/9,7)eO} {KMeCi} 



Therefore, we have an equivalent optimization problem to that presented in equation ( 1591 ), but with less number of 
variables, i.e., d(r) can be equivalently written as 

d(r)= min , f ( 74 > 

When (a + 6) = n, the objective function has a property which we state now and will be helpful to solve the 
minimization problem in the next section. 

Claim 1: F (<fi a {a), <fip(n — a), </> 7 (y)) is monotonically decreasing with a. 

Proof: It can be shown using equations d66b-d68b that when (a + b) = n we have 

(a* + Pj) = 1, V(* + j) = (n + 1) and (a, + fc) < 1, V(i + j) < n. 

Using these relations in the expression for F ya, /3, 7^, after some algebra we get 

p t 
F(&J,^j {m + n + 1 - 2i)&i + 53 (m + k + 1 - 21)% 
»=i ;=i 
p 



=^(m + n + 1 - 2i) f 1 - (a - i + 1) + j + ^(m + + 1 - 2/)^. 

i=l Z=l 

The above expression is clearly independent of 6 and monotonically decreasing with a. ■ 
Step 3: In this final step, we determine the minimum of F(.) on 0\ and establish the theorem. Depending on the 
value of y the set of feasible (a, b) pairs takes on different shapes as shown in the following figures. For example, 
when njzjpj < y < r^zp, tne feasible set of (a, b) pairs is the trapezoid ABDE shown in Figure |8(b)| 
For any given value of y the following observations will help us solve the problem: 

• The optimal (a, b) pair always lies on the boundary, because the objective function is monotonically decreasing 
with both a and b. 

• By the same argument the optimal point on the line segment AB is B. 

• The optimal point on the line segment BC is C. Because, by Claim Q] when (a + b) = n, the objective function 
is independent of b and monotonically decreasing with a. 

Thus for a given y, the objective function has to be minimized on the line segment CD, BD or DF. In what 
follows, we treat each of these cases individually: 

1) When IZi = (r, j^zpj , the optimal (a, b) lies on DC and the diversity order is given by 



2) When K 2 



qr qr 
(n—r) ' (q—r) 



d*{r)= min F (cf) a (a), 4>p(b), (j)j(y)) . 

{»£Ri, o<b< v(n - r) } 

, the optimal (a, b) lies on BD and the diversity order is given by 

d*(r) = min F (</> a (a), <f>p{b), 7 (j/)) . 

{y£K 2 , 0<b<q} 
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D 



k 




\fa + b) = 


n 








+ 6(1 - r -)=r 


b = q 





A u 

( a)7e 1 = {r< !/ <^}. 




FA 5 

(C) ^3 = 1^ 



D 



\ (a + 6) = 






6 = g 







W ^ = {(^< y < I ^ T } 



Fig. 8. Sets of feasible (a, fc) tuples for different range of y. 



3) When 7?-3 = (jjpzpj,t , the optimal (a, 6) lies on DF and the diversity order is given by 

tf(r)= min F (0 Q (a), (j>p(b), (j>j(y)) . 

where a + 6 fl — ^ J = r in all of the above cases. Finally, combining the different cases we get 



d(r) = min min F 

l<i<3 {yEKi, b£B z (y)} 



r-b[l--) ),^(6),0 7 (y) 



where B\{y) 



0, 



y(n-r) 



; B 2 (2/) = [0,g]; B 3 (y) 



0. 



ry 



(y-r) 



Appendix D 

Proof of Theorem[5] The DMT on a (n, 1, n) relay channel 

We consider the case where n = m > 2 and k = 1. Combining it with Example Q] the proof of the theorem will 
be complete. From Theorem |4] we know that when r > t = 1, d*(r) = d„ „(r). So, let us consider the case when 



r < 1. Since the optimal solution always he on the line a + b [1 — = r, r < 1 implies that a < 1. Using this 
in the definitions of 0,s, we get 

G(a, 6, y) =F ((/) a (a), <j>p(b), <Ay(</)) , 

=n(n - 1) + 2?i(l - a) + n(l - 6) + - y) - n + (a + b - 1) + . 
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We know from Theorem [4] that the above objective function has to be minimized over three different sets of (a, b) 



pairs. For r < 1, 1Z\ = <£>, so we consider y € 7?-2 



n—r ' 1— r 



We know from Figure |8(b)| that the optimal 



solution lie on the line segment BD and since the objective function is also linear in a and b the optimal solution 
is one of the extreme points depending on the slope of the line 

+ 6(1-- 

V y 

Since 

1 



1 - - < r V y £K 2 , 

y) 2 

the objective function attains its minimum at B, where b* = 1 and a* = r — ^1 — . Putting this into the objective 
function we have 

G(a*, b*, y) = (n - l) 2 + (2n - 1)(1 - r) + (2n - 1) (l - + n(l - y). (75) 
The above function is convex in y and the infimum is attained at y = r which is given as 

d (TZ 2 ) = (n - l) 2 + (3n - 1)(1 - r), < r < 1. (76) 

Next we consider the case when y G 7^3 = (■jrf > 1 ■ This set is non-empty only for r < | and the optimal 

and 



point lie on the line segment DF in Figure [8(c)| Dividing the set IZ3 further into two subsets IZ31 = [jz^:, 2 
7^-32 = (2r, 1] we see that ^1 — < | when y 6 7?-3i and ^1 — ^ J > | when y G 7^32. The objective function 
attains minimum value at point F when j/ G 72-31 and at point D when y € 7^.32 and given by 

for 7/ G 7^3i ; 



G(a*,b*,y) 



v 

y—r 



n(n - 1) + 2n(l - r) + (1 - y), for for y G 72 3 2, [v (a*,6*) = (r,0)] 
Now, optimizing this function in the corresponding sets of y we get 



G{a\b\y*) 

which in turn imply 



n 2 + 2n(l - 2r), [v y* = 2r, for y e TZ 31 ] 
n{n - 1) + 2n(l - r), [•.■ y* = 1, for y G ft 32 ] • 



d (7e 3 ) = n(n - 1) + 2n(l - r), for < r < i. (77) 
Now, combining equations d76l l and (1771 we get 

d(r) = min{d(7e 2 ),d(7e 3 )} = (n - l) 2 + (3n - 1)(1 - r), for < r < 1. (78) 
Finally, putting this in Theorem [4] and combining the result with Example Q] we get 

^T^ , , < r < 1 and n = 1; 



dl{r) = \ ( n -l) 2 + (3n-l)(l-r), < r < 1 and n > 2; 

dn,n(r), 1 < r < n and n > 2. 
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Appendix E 

Proof of Theorem[6J The DMT on a (1, k, 1) relay channel 
On a (1, k, 1) channel we have a,b,y < 1. Putting this in the definitions of 0iS from Theorem|4] we get 



G(a, b,y)=F (M"),Mb),Mv)) = ( k + - r) + (fc + 1)6 ( 1 - - ) - AA + k(l - y). 



Since 



1 



the objective function attains its minimum at b* = m&x{Bi(y)} = y (^r-)- Putting this into the objective function 
we have 

(k + iy 



G(a*,b*,y) = b* + k(l - y) = k + y 



1 



Clearly, the largest and smallest feasible value of y in IZi — y~, -^—^ 
coefficient of y is negative and non-negative, respectively. That is, the optimal y is given as 

r, for < r < 



(79) 



minimizes the above function when the 



l-r 

1. 



for 



(i+k) 



< r < 



for r > A. 



Putting this solution in equation ( |79"I ) we get 

d{Ki) -- 



(fc + l)(l-r), for0<r<±; 



1 + k 



for 



< r < 



for | < r < 1. 



(80) 



This set is non-empty only 



Since for in = n = 1, 72-2 = 3>, next we consider the case when y £ 7Z 3 = yj^, 1 
for r < i and the optimal point lies on the line segment DF in Figure |8(c)| Dividing the set 1Z 3 further into two 
subsets namely IZ31 = (^jz^:, (k + l)r and 7^32 = ((k + l)r, 1 , we see that ^1 — < ^ when y £ 1Z 3 \ and 
> i when y £ ^32 . The objective function attains minimum value at point F, where b = max B 3 (y) 



1 - 

when y £ 1Z 3 \ and at point D, where b = minify) when y £ 1Z 3 2 and given by 



G{a*,b*,y) 



y 

y—r 



for y £ Tl 31 ; 



yr 

y—r 



l + k[2 

(k + 1)(1 - r) + fe(l - y), for for y £ TZ 32 . [b* = 0] . 
Both these functions are minimized by the maximum value of y in their corresponding range which are 

(k + l)r, when y £ 1Z a i n {r < ^^y} ; 

when y £ TZ 3 i n |r > (T^iy } ; 
when y G 7e 32 n |r < (TTiy} > 



J7 = 



1. 
1- 



which in turn gives us 



d(K 3 ) 




1)(1 - r), for < r < 



for 



Jk+Tj 



<r< 



(81) 
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Combining equations (f80b and ( |8lT i we get 

d(r) =min{d(n 1 ),d(TZ s )}, 

(fc + l)(l-r), for0<r<±; 
l + fc(i^), for^<r<±; 
(i==) , for i < r < 1. 

Finally, by Theorem [4] we have c^fr) = d(r). 

Appendix F 

Proof of Theorem[7J The DMT of the (2, fc, 2) relay channel 

Instead of optimizing the objective function over all possible values of (a, b), we restrict the minimization over 
a set on which a + b = n and thus obtain an upper bound. From the proof of Theorem [4] we know that the optimal 
(a, b) pair lies on the line 

a + b ( 1 - ^ = r. (82) 
Combining it with the assumption just made, it is clear that the optimal choice of a, for any feasible y, is given as 

a = 2 - y. (83) 



r 

Now, by Claim Q] we know that when a + b = n, the objective function becomes 

2 2 

^(5-2^ + ^^ + 3-207/- 

i=l i=l 

Computing the values of a and 7 from the definitions of <pi& and substituting in the above equation we get 

2 q 

d u (r) = min V (5 - 2i) ((i - a*) A 1) + + V (k + 3-2/) ((/ - y) A 1) + , 

r<y<q — ' ^ — ' 

1 ^ — 1 

= r min^(5-2i) ((«- 2 + (^") v ) A X ) + 5Z (fc + 3 ~ 2l) ((/ ~ y) A 1)+ ' (84) 
To optimize this function with respect to y, in the following we divide the values of a* and y in four different 
regions as follows 

Tl UtV = {(u - 1) < a* < u, (v-l)<y<v}, u,v = l,2. (85) 

In any of these regions the function is minimized either if y attains its maximum value, denoted as ym in that 
region or the minimum value, denoted as y m in that region. However, in a particular region, the extreme values 
depend also on the value of r. So in the following, depending on r each region will be further divided into several 
sub-regions and in each sub-region the two extreme values of y will yield two values of the objective function as 
functions of r. We will have to take the minimum among all these functions to get the desired minimum of the 
objective function. 

Region TZi\: In this region the set of feasible values of y is given by the following 



r 2r 

— <y< 



^}n{o< y <i}n{r<y}. 
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Note for < r < 1, y m = r, for < r < |, yu = ^37, f° r I — r — 1 Vm = 1 and f° r r > 1 the above set is 
empty. From the aforementioned argument, we get the minimum value of the objective function in this region by 
putting these values of y in equation ( f84T > as 

d2,a(r) + da )fc (r), < r < 1; 
d(r,^ 1 ,i) = min^ k + 3 + (jfc + 1) (^) , < r < §; (86) 



A: + 6 



< r < 1; 



Region 7^1,2- In this region, the set of feasible values of y is given by 

2r 



2-r 



2-r 



|n{l <y<2}n{r <y}. 



For I < r < 1, y rn = 1 and j/m = -^-p, for 1 < r < |, y m = and j/m = 2 and for r > | the above set is 
empty. Putting these values of ?/ in equation d84b , we get 

d(r, 75.1,2) = min < 



4 + 4(*-l) , |<r<l 



1 



4(^1), l<r 



(87) 



1 < r < 



< 



Region Ttiy. In this region the set of feasible values of y is given by 



0<y< 



n{o<t/<l}n{r<y}, 



2-r 

which is an empty set. 

Region Ti-2,2- In this region the set of feasible values of y is given by 

r 



jo < y < J n {l < y < 2} n {r < y y 



This set is empty for r < 1; for 1 < r < |, t/ m = r and yu = j^, for | < r < 2, y r „ — r and j/m = 2. Again, 
putting these values of y in equation (I841 l. we get 

da,2(r) +d2,*(r), 1 < r < 2; 
rf(r,7e 2 , 2 ) = min <( 1 + (fc - 1) , 1 < r < |; (88) 



2(2==), |<r<2; 



Finally, combining equations (I86I 1. d87l i and (l88l the theorem is proved. 



Appendix G 
Proof of Lemma[2] 

The determinant of a matrix can be written as the sum of several terms, where each term is a product of some 
of the elements of the matrix and ±1 (e.g., see Section 0.3.2 in ll28l ). In what follows, we shall first use this result 
implicitly to simplify Ai(er,£). The terms of Ai(ct,£) will be gradually approximated in such a way that when 
expanded as a sum, the exponential order of each term in the sum remains unchanged, which in turn imply that 
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the exponential order of Ai(<7,£) itself remains unchanged. Then combining it with the asymptotic expressions for 
the Vandermonde matrices Vi(£) and Vi(<r) we get the desired identity. 

Replacing x by £ and y by a in the expression for Ai in equation d45l l. we get 

d 



Ai(a,e) 



'N 2 



W2-JV1-I) 
N 2 



1 (Tn 2 

1 (JVa-JVi-1) (JVa-JVi-1) 



1 



OVj-JVi-l) 



cr 



(JV 2 -JV!-l) c - 



e "i 



CT (JV 2 -1) e 



(JVa-iVi-l) -ii. 



=T 



-(iV 2 -JVi-X) -1 
°N 2 ' ' ' °iV 2 



1 e~^~2 



Si 

e ""2 



CT (Af 2 -l) CT (W 2 -1) 1 e 



_(iV 2 -JVi-l) -1 

ctj ■ ■ ■ a x 



SiV-i 



e "(» 2 -i) 



e "i 



where 71 



- llz=i \ oi 



(JVa-JVi-l) 



Now, putting <Tj = (1 + pA(jv 2 -i+i)) 1 , Vi in the above equation we get 



= Ti 



(l + pAx)^-^- 1 ) -..(1 + pXi) 1 e-^i( 1+ " Al ) ••• e-^( 1+ " Al ) 

{l+p\ 2 )^- N ^ -..(1 + pA 2 ) 1 e-^i^+P^) ••• e-^( 1+ " A2 ) 

(1 + pAjv.)^ 2 ^ 1 " 15 •••(! + pAjyJ 1 e -««i( 1 +' ,A «2) ... e ^i(i+pA„ 2 ) 

1 (1 + pAi) •••(l + pAi)( Ar2 - Ari - 1 ) ••• e-«^ Al ) 

1 (l + pA 2 ) •••(l + pA 2 )( Ar2 - Ari - 1 ) e-^^pA.) ... e -Ci(pAa) 

1 (1 + pAjvJ •■■(l + pAjv,)^"^- 1 ) e-^^pA^) ... e - €l ( P A N2 ) 



where T\ = To rij=i e £>. We have also ignored the sign change due to the row operation since Ai is a part 
of a pdf. To simplify the determinant in the above equation we do the following column operations: d — > d — 

Ci_i(l + p\i),2 < i < (N 2 - Ni) and d -> d - de'^-^+^f* 1 , (N 2 - Nt + 1) < i < N 2 . Since all 
the eigenvalues vary exponentially with p, for asymptotic p using the approximation p£j(\i — Ai)= — p£,jXi and 
Aj — Ai= — Ai, Vi > 2 and V j, we get 

1 ■■•0 ••• 

1 -pAi (l + pA 2 ) (Ar2 - JVl - 2) pAi ••• e-b^ (1 - e-^ Al ) 



Ai =Ti 



=T 2 



1 -pAi (1 + pA A r 2 )( Ar2 - Afl - 2 )pAi ••• (1 - e -iiP^) 

1 (l + pA 2 ) ••■(l + pA 2 )( JV2 - A ' 1 - 2 ) e -««i(" A 2) ... e -ei(pA») 

1 (l + p\ N2 ) •••(! + pAiv 2 ) (iV2_JVl " 2) e-^i^ ••• e"^^) 
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where T 2 = T\(p\\)( N *- Nl -- v > (l - e~^ Al ). Proceeding in the same way we get 

JV2-JV1 / ATi \ 

Ai= I] W^-^-^nCl-e-^) Wi 

i=l \ 3=1 / 



N 2 -N! I JVi 
i=l V 3=1 



(1 + ppi) 



e -Ci(i+p/ii) 



-pfJViCi+wvj) ... e -?i(i+PA'N 1 ) 



(89) 



where /i; = AjVj-j^+j, 1 < ' < -^Vi and T2 = 11^=1 (1 + ^ 2 Wl ^- Now, denoting the determinant in 

equation d89l by Aq, we get 



A, 



(a) 



3 -€l(l+PMJVi) g-?2(l+PPJV 1 ) ... p-CiVj (1 + PPJVj ) 



g-P?l(l+Pl) g-?2(l+PPl) 



j-fWl (1 + PPl) 



e -^ e -(pfi/ 1 (w 1 +i-i)) , \ TT J^J ^i_ e -(pffMt) 



n 



(90) 



(91) 



3 = 1 t=l 

where equality (a) is obtained by rearranging both the rows and columns in the reverse order and the last equality 
follows from Lemma |2] Using this expression and replacing the values of /i;'s in equation (f89b we get 

N 2 W2-JV1 Ni Ni (N3-3) 

A 1 =[](l + pA l )- (Ar2 - JVl - 1) ((pAi)^'"^-*)) J] (e-^e-^ A ^+w))) J (1 - e -(^ 



i=l 



3=1 



3=1 »=1 



On the other hand, using equation (0 at high SNR, we have 



viconef 1 -^ [Hi 



(JVa-JVi) 



3=1 



i=l 



3=1 



(JV2-3) 



Vi(a) =det([(l + pA ( ^_ i+1) )-0'- 1 )]^t) ■ 

= ffl( 1 + ^) (1 ~ Ar2) l dct ([(i + M^-i+i))^"^]^ 



(92) 



(93) 



(94) 



I(l+pA0^ det [(l + pA,)^ 1 )]^ , 



' JV 2 



= m( i +^) (i " jV2) n^.W" 



(95) 



*>i=l 



Vi=l 
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Finally, combining equations d92|i-(l95t we have 

N 2 JVi 



n£in£r' } (i-e-^->) 



V 2 -i)AiVi 



iV 2 JVl 

=na+M^n(e. 



J = l 



JVi (iv 2 -i) 

n n 

i=i »=1 



1 _ e -(pCi A i) 



Appendix H 
Proof of Lemma[3] 

We shall simplify the term D(£, A) using a similar method as in the proof of Lemma|2] Let us start by expressing 
D(£, A) as follows 



N 2 



_ £lpAjV 2 . . . g~ £iY 2 PAiV 2 



-?ipAi 



-?N 2 pAi 



> N 2 \ 

ft A, A. 

vi=l / 



(96) 



It is well known 12811 that, for any square invertible matrix A and square matrix K the following identity holds. 

A B 



C K 



\A\\K -CA~ 1 B\ 



To simplify A we substitute A = e ? 1 ' )A «2 and use the above equation to get 

g— ?2pAjv 2 -i , . . g— £jv 2 pAjv 2 _i 



-?2pAi 





g — ClpAjV 2 -l 




_ g?lPAiV 2 




|^g-?2pAiv 2 , , _ g-?N 2 pAjv 2 j 




g-?ipAi 





Since the eigenvalues £j's and Aj's vary exponentially with SNR (p), from the ordering among themselves and 
equation © we have Aat 2 — Aj= — Aj, Vi < (A^ — 1). Using this in the above equation we get 



g — ?2pAjv 2 -l ... g — £jV 2 pAiY 2 -l 



-?ipAj\ 



-?2pAi 



-C 2 pAiv 2 -l 



3 -? 2 pAi 



g— ?N 2 pAi 
g— £at 2 pAjv 2 _i 





g — ClpAjV 2 -1 




g-SipAi 



-f2PAjV 2 . , , „ — ?N 2 pAjV 2 " 



-?]Y 2 pAi 



3 — ^ipAjv 2 -i ... g~ CipAjv 2 -i 



g-CipAi . . . g-fipAi 



(6) 



-£ipAiv 2 



iV 2 -l 



-C 2 pAn 2 _i g— CipAn 2 _i^ , , , g— £n 2 pAiv 2 -i ^ g— CipAjv 2 -i^ 

g-?2pAi H _ e -£ip\i\ . . . g-£« 2 pAi h _ g-£ipAi^ 

g — {2PAn 2 _1 , _ _ — 5« 2 pAjv 2 -i 



=g — JipAw 2 



n 



3 -6pA« 



-5 2 pAi 



-5n 2 pAi 
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where step (a) follows from the fact that (£iAj + £,2^N 2 )=£,i^i, Vi < (N% — 1) and step (6) follows from the fact 
that (£iA, — £j^i)=t;iK Vj > 2, i Proceeding in the same way we get 
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